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In the social sciences, the debate over the structural foundations of social capital has long vacil- 
lated between two positions on the relative benefits associated with two types of social structures: 
closed structures, rich in third-party relationships; and open structures, rich in structural holes and 
brokerage opportunities. In this paper, we engage with this debate by focusing on the measures 
typically used for formalising the two conceptions of social capital: clustering and effective size. We 
show that these two measures are simply two sides of the same coin, as they can be expressed one in 
terms of the other through a simple functional relation. Building on this relation, we then attempt 
to reconcile closed and open structures by proposing a new measure, Simmelian brokerage, that 
captures opportunities of brokerage between otherwise disconnected cohesive groups of contacts. 
Implications of our findings for research on social capital and complex networks are discussed. 

PACS numbers: 89.75.Hc, 89.65.Ef, 87.23. Ge, 89.75.-k 



INTRODUCTION 

A fundamental idea in the social sciences is that social 
capital originates from social relations. As a result, so- 
cial structure has long been seen as playing a crucial role 
in sustaining or hindering a wide range of performance- 
related outcomes, both at the individual and collective 
levels [2^, 26, 37-3^]. However, while social scientists 
tend to agree on the salience of social structure, there is 
still controversy over which type of social structure mat- 
ters as a source of social capital. In this respect, two 
opposite types of structure are typically advocated: the 
"closed" and "open" structures. On the one hand, pro- 
ponents of the benefits of closed structures draw on the 
idea that social cohesion fosters trust [ll, 14, 47, 54 1 and 
a sense of belonging [l3| , sustains cooperative behaviour 
13l |32| and the enforcement of social norms 
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and facilitates the creation of a common culture |42| . On 
the other, advocates of the benefits of open structures 
emphasise the value that actors can extract from being 
located near structural holes separating non-redundant 
contacts, and thus from acting as brokers between other- 
wise disconnected others jfj H, 0, [3^, [f52| . 

This paper aims to draw on the interplay between 
these two alternative conceptions of social capital, and 
engage with the ongoing debate over the relative benefits 
associated with closed and open structures. The rela- 
tion between these two types of social structure will be 
explored through a comparative analysis of the meas- 
ures with which these structures have been tradition- 
ally opcrationaliscd and formalised: the clustering coef- 
ficient and the effective size of an actor's local neigh- 
bourhood. In particular, we will show that it is pos- 
sible to derive a simple mathematical relation between 
the clustering coefficient and the effective network size of 
a node. The existence of this relation between the two 



measures, which have been originally and independently 
introduced with the purpose of formalising two differ- 
ent concepts, supports the idea that social cohesion and 
structural holes are no more than the two sides of the 
same coin. Both measures can indeed be expressed in 
terms of number of links and number of triangles incid- 
ent upon a node. 

The question as to whether social capital stems from 
open or closed structures will probably always remain a 
matter of debate. There are cases where one is more 
interested in the benefits coming from closed structures 
231 ] , and other cases where, conversely, it is more conveni- 
ent to exploit the existence of open structures [52| . How- 
ever, our work highlights that, to characterise the local 
structural properties of a node, it is equally informative 
to measure the clustering coefficient or the effective size of 
the node's local neighbourhood. Drawing on this, we will 
then use the relation between clustering and effective size 
to develop a novel measure for a generative mechanism of 
social capital that lies at the interface between closed and 
open structures 0] : Simmelian brokerage. Being sensitive 
to variations in the position of links across local networks 
of the same density, this measure can capture opportun- 
ities of brokerage between otherwise disconnected groups 
of densely interconnected nodes HI, 33| • For this reason, 
Simmelian brokerage can be seen as suitable for form- 
alising structures that lie at the interface between the 
closed and open ones: it captures the extent to which a 
node's local network is characterised, on the one hand, 
by a combination of structural cleavages between distinct 
groups of contacts, and on the other by a closed cohesive 
structure within the boundaries of each group of contacts. 

The paper is organised as follows. In Section „ we in- 
troduce the concept of social capital, and offer a general 
overview of the two main theoretical conceptions of its 
structural foundations. In Section- we review the defin- 
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itions of clustering coefficient and network effective size. 
In Section n we show that the two measures are linked 
through a simple mathematical relation. Finally, based 
on this relation, in Section,, we introduce our new meas- 
ure of Simmelian brokerage. The last Section will sum- 
marise and discuss our main findings. 



STRUCTURAL FOUNDATIONS OF SOCIAL 
CAPITAL 

The premise that seems to underpin most perspectives 
on social capital is the idea that investments in social re- 
lations yield expected returns in the marketplace, includ- 
ing the community, the economic, financial, political, and 



labour markets [3JJ, [38j . As argued by Coleman, social 
capital can be characterised by two distinct properties: 
it "inheres in the structure of relations between actors 
and among actors", and "like other forms of capital, [it] 
is productive, making possible the achievement of cer- 
tain ends that in its absence would not be possible" 
p.S98]. 

Social scientists have long agreed on the salience of so- 
cial structure as a source of social capital (2iJ [26|, [37l - l39| . 
For instance, social structure has been seen as playing a 
pivotal role in sustaining individuals' and organisations' 

ll|, generate and trans- 
i3j, and enhance creativ- 



efforts to create value [25 
fer new knowledge Il2t \2 r i 

ity [l|,[^,0,|S|43,[5l[|53. As suggested by a number of 
scholars [26], |37| , three main explanations can be offered 
as to why social structure affects the outcomes of purpos- 
ive actions. First, social structure can facilitate or hinder 
the flow of information, and in so doing it also impacts on 
its quality [27, 47, 53, 55 1. Second, social structure can 
be seen as a source of reward and punishment due to the 
effects that social relations have on the internalisation 
and enforcement of social norms, including those against 
free-riding 24, 3^- Third, social structure nurtures and 
promotes the attainment of actors' trust, reputation, so- 
cial credentials, status, identity and recognition through 
processes of third-party referrals and reinforcement of in- 



teractions 

Despite the convergence on the explanatory relevance 
of social structure, however, there is still controversy and 
debate over the type of social structure that matters as 
a source of social capital [!, HI, 22, 37, 38 1. Theoretical 
conceptions of the structural foundations of social cap- 
ital vacillate between two positions that vary in their un- 
derstandings of the benefits associated with two oppos- 
ite types of social structure: "closed" and "open" struc- 
tures. Arguments in favour of each of these structures 
have been inspired by distinct rich traditions in sociolo- 
gical theory. Both arguments, however, are conceptually 
rooted in Simmel's seminal theoretical contributions on 
the expansion of a dyadic relationship into a three-party 
relationship (" Verbindung zu dreien"), and the sociolo- 



gical significance of the third element [50(. Simmel ar- 
gued that the introduction of a third party fundamentally 
changes the social dynamics of a dyadic tie: "The appear- 
ance of the third party indicates transition, conciliation, 
and abandonment of absolute contrast (although, on oc- 
casion, it introduces contrast)." [H^, p. 154]. Simmel's 
emphasis here is on the two alternative functional roles 
the third party can play in the triad: the "non-partisan' 
or mediator with the tertius iungens for "the third who 
joins") orientation on the one hand [43j, and the broker 
with the tertius gaudens (or "the third who enjoys") ori- 
entation on the other Q. 

Proponents of the benefits of closed structures typ- 
ically build on Simmel's [H(| tertius iungens logic and 
Coleman's 13J, [l4j conception of social capital prcdic- 
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ated on the mechanism of social cohesion 
the years, the Simmelian triad has provided the theor- 
etical backdrop against which scholars have investigated 
the relational hypothesis that actors separated by one 
intermediary are more likely to become connected with 
each other than actors that do not share any common 
13, E3, El, [H 0, [H, [H . At the macro 
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level of a social system, the tendency of actors to forge 
links locally within groups is conducive toward the cre- 
ation of cohesive social structures, organised into well- 
defined tightly knit communities that are densely connec- 
ted within but not across boundaries 0, [34| • One of the 
most influential theories of social capital, advocated by 
Coleman 13j , is predicated precisely on the benefits that 
actors accrue from being socially embedded within co- 
hesive social structures, rich in third-party relationships. 
Among the closure-based sources of social capital are 
normative control and deviance avoidance [si. WL 37. 381]. 
More generally, network closure enables the emergence 
and enforcement of social norms by encouraging the in- 
ternalisation of standards of acceptable behaviour and 
facilitating the detection and punishment of defective be- 
haviour [321 . [54j. In addition, it has been documented 



that being part of a close-k nit g roup engenders a sense of 
belonging 13| , fosters trust [llj, L 



14, 



471 [54j , facilitates the 



exchange of fine-grained, complex, tacit, and proprietary 
information 27, 541, enables the creation of a common 



culture and the emergence of a shared identi ty [421] , and 
helps sustain a high level of cooperation 1^, 32| . 

Despite the benefits associated with social cohesion, 
the tendency of individuals to cluster into densely con- 
nected communities also bears a two- fold cost: local re- 
dundancy and social pressure. On the one hand, the 
more an actor's contacts are connected with each other, 
the less likely they are to take the actor closer to diverse 
sources of knowledge and resources that the actor is not 
already able to access 25j. Paucity of connections with 
new and non-redundant social circles may create isolation 
and eventually degrade social capital. This is the cent- 
ral argument of Burt's [|| seminal contribution on the 
benefits associated with occupying brokerage positions 
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between otherwise disconnected individuals or groups in 
a network. On the other hand, above and beyond the re- 
dundancy of knowledge and resources, a cohesive struc- 
ture can still exert a negative influence on the connected 
actors as a result of the social pressure favouring conver- 
gent thinking and group consensus. As dense third-party 
relationships engender reciprocal behaviour and sustain 
high degrees of similarity among the actors, they are con- 
ducive toward the maintenance of the status quo rather 
than the exploration of novel paths leading to divergent 
solutions [19|, [51 1 . 



Both types of costs - redundancy and social pressure - 
associated with social cohesion have inspired an altern- 
ative conception of social capital, typically distilled into 
the proposition that there are benefits actors can extract 
from participating in open structures that are rich in 
cleavages and opportunities of brokerage d, H, B 0, [EH . 
At the heart of this conception of social capital lies Sim- 
mers fsoj ] characterisation of the role of tertius gaudens 
in a triad. While the non-partisan tertius iungens aims 
"to save the group unity from the danger of splitting up" , 
the tertius gaudens wishes to create or intensify discon- 
tinuities in the social structure by forging or preserving 
unique ties to disconnected others. 

The idea that social capital can originate from broker- 
age opportunities associated with structural gaps has 
been most thoroughly explored by Burt, who has perhaps 
contributed more than any other sociologist in recent dec- 
ades to examine the structural features and performance 
implications of brokerage, especially in organisational do- 
mains . Burt defines a structural hole as the "separa- 
tion between non-redundant contacts" , "a relationship of 
non-redundancy between two contacts" , "a buffer" that 
enables the two contacts to "provide network benefits 
that are in some degree additive rather than overlap- 
ping" 0, p. 18]. Burt further identifies two sources of the 
social capital that an actor can mobilise by acting as the 
broker between contacts at the opposite sides of the hole: 
information benefits and control benefits. On the one 
hand, information benefits originate from the fact that, 
in open structures rich in structural holes, connections 
tend to be weak [25j] and more likely to link people with 
different ideas, interests and perspectives 0. By gaining 
exposure to a greater variance and novelty of informa- 
tion, actors embedded in brokered structures will be more 
creative and successful in their endeavours 0, 0, l5l| . 
On the other, control benefits are related to the third 
party's ability to gain an advantage by negotiating his or 
her relationships with disconnected others and turning 
their "forces combined against him into action against 
one another." [EH P- 162]. Preserving and fostering 
disunion between parties thus enable the actor standing 
near a structural hole to extract social capital buried in 
the hole, by playing the disconnected parties' demands 
and preferences against one another. 



MEASURING SOCIAL COHESION AND 
STRUCTURAL HOLES 

If social cohesion and structural holes have long rep- 
resented two distinct conceptual pillars, each underlying 
one of the two opposing conceptions of social capital, they 
have also been formalised through two distinct, and in- 
dependently developed, measures: respectively, the clus- 
tering coefficient and the effective size of a node's local 
network. While clustering has been typically used for 
measuring the extent to which a node is socially embed- 
ded within a closed cohesive structure [ljl [H, 0, [EH , ef- 
fective size is a measure for detecting the non-redundancy 
of a node's contacts, and therefore the degree to which 
the node's local neighbourhood is rich in structural holes 
Q . The remaining of this Section is organised into two 
parts. Section n will review the definition of node clus- 
tering coefficient, and discuss an alternative measure for 
cohesion, node local efficiency. Section „ will be devoted 
to the formalisation of effective size. 



Clustering coefficient and local efficiency 

Let us consider an unweighted undirected graph 
G(V, L) with N = \V\ nodes and K = \L\ links, and let us 
focus on one of its nodes, node i, with i G {1,2,..., N}. 
In order to measure the local cohesion of node i, we define 
the subgraph Gi induced in G by the set A/i of the neigh- 
bours of i. The node clustering coefficient C\ of node i 
can then be defined as [581 y59f : 



K\Gi 



h(h - i)/2 




for k r > 2 
for hi = 0,1 



(1) 



where AT[G,] is the number of links in Gi. The node 
clustering coefficient d indicates the probability that two 
neighbours of node i are connected by a link, and is prop- 
erly normalised by definition such that < Cj < 1. In 
fact, the value of Ci in Equation ((T|) is the ratio between 
the actual number of links A'[G^] in the subgraph in- 
duced by the first neighbours of i and their maximum 
possible number, that is r?) = ki(ki — l)/2. Notice that 
A'[Gj] is also equal to the actual number of triangles con- 
taining node i, while ki(ki — l)/2 is the number of open 
triads centred on i, which corresponds to the maximum 
possible number of triangles containing a node i with hi 
links. Therefore, the node clustering coefficient d in 
Equation |T]) can be alternatively seen as the proportion 
of triads centred in i that close into triangles. 

As an example, let us consider the two graphs G a and 
G h shown in Figure Q] In both graphs, node i, coloured 
in yellow, has degree fej = 4, so that the sets Af? and Aj 
of the neighbours of i contain four nodes each. The four 
neighbours of i, labelled as nodes 1, 2, 3, 4, are shown as 
red filled circles, while the links connecting these nodes 
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to i are shown as dashed lines. The subgraph Gf in- 
duced in G a by the set AT® of the neighbours of i has four 
nodes and three links shown as solid lines. Similarly, the 
subgraph G\ induced in G b by the set J\f b contains four 
nodes and three links. Therefore, the clustering coeffi- 
cient of node i is equal to 1/2 in both cases. In fact, 
the four nodes of the induced graphs can be connected 
to each other through at most six links. In the figure, 
only three of these potential links are present. Hence, 
Ci = 3/6 = 1/2. Notice that, in general, the subgraph 
Gi of the neighbours of node i can be unconnected. For 
instance, this happens for the case of graph b) in Fig- 
ure!]] where node 4 is isolated in subgraph G\. However, 
this does not affect the mathematical definition of the 
clustering coefficient C,. What is instead problematic is 
a node i which is itself isolated in the graph, or has only 
one link. In this case, the ratio 2K[G i] / (ki(ki — 1)) is 
not defined. The usual convention, in this case, is to set 
Ci = when the degree of i is either zero or one, as 
reported in Equation (fT]). 

An important feature of the clustering coefficient of 
node i, as defined in Equation ([I]), is that it only depends 
on the number of links in the subgraph Gi, and not on 
which pairs of nodes are actually connected through such 
links in G,. Notice, for instance, that both Gf and G\ in 
Figure [1] have three links, but while Gf is a line of four 
nodes, G\ consists of a triangle and an isolated node. For 
this reason, here we discuss an alternative measure for the 
cohesion of the neighbourhood of a node, the node local 
efficiency, that enables the two cases shown in Figure Q] 
to be clearly differentiated. 

In an unweighted undirected graph G(V,L), the node 
local efficiency of node i is defined as the efficiency of the 
subgraph Gi 
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361 ] . where the efficiency of a graph is 
the average of the inverse of the distances between the 
nodes of the graph. Therefore, the node local efficiency, 
Ei, of node i can be written as: 



l_y y J_ 

i - 1) A ^ </,„ 



kiiki 1) 



(2) 



where E[Gi\ stands for the efficiency of graph Gi, while 
the quantities eg. m represent, respectively, the efficiency 
in the reachability between node I and node m, and are 
set equal to the inverse of the distance di m between the 
two nodes. Notice that the distances are evaluated on 
the graph G,;, and not on graph G. Moreover, the local 
efficiency is properly normalised by definition, such that 
< Ei < 1. Therefore, it takes values in the same range 
as the clustering coefficient. 

By making use of Equation @, it is possible to dis- 
tinguish between the roles that node i plays in the two 



graphs G a and G b in Figure [T] If we calculate the dis- 
tances between the four neighbours of i in Gf , we ob- 
tain: d\2 = (I23 = e?34 = 1, e?i3 = e?24 = 2 and c?i4 = 3. 
Hence, we have £ i2 = £23 = £34 = 1, £13 = £24 = 1/2, and 
£14 = 1/3, so that the local efficiency of node i in graph a) 
of Figure [1] is Ef = 13/18. Conversely, if we consider G b , 
we obtain: d\2 = ^23 = d\s = 1, du = ^24 = ^34 = 00. 
Thus, we have: £ i2 = £23 = £13 = 1, e i4 = £24 = £34 = 0, 
and in this case the local efficiency of node i is E\ = 1/2, 
which is smaller than Ef. Thus, even if node i has the 
same clustering coefficient in the two graphs, its local 
efficiency is different in the two cases. 

The mathematical definition of efficiency we have ad- 
opted implies that the efficiency of a graph with a fixed 
number of nodes increases by increasing the number of 
links. And for graphs with the same number of nodes and 
the same number of links, the efficiency depends on where 
the links are actually located in each graph. In particu- 
lar, the efficiency of a chain of three links connecting four 
nodes is higher than the efficiency of a triangle combined 
with an isolated node. The reason for this is that, in the 
latter case, the presence of an isolated node affects the 
overall reachability among nodes in the system: notwith- 
standing the presence of a triad of connected nodes, the 
isolated node is not reachable from the other ones. 



Effective size 

The original formalisation of the idea of structural 
holes was proposed by Burt for weighted graphs 
Therefore, here we will first begin our analysis of meas- 
ures for open structures with the most general case of 
a directed weighted graph G(V, L, W). We will then re- 
strict our focus to the particular case in which all weights 
are equal, since here our interest is on undirected and un- 
weighted graphs. 

Let us indicate as Wij > the entry of the 

weighted asymmetric matrix that describes the directed 
weighted graph G(V, L,W). As argued by Granovetter 
[25|, the weight of a link between any nodes i and j has 
the following meaning: a high (low) value of my indicates 
a large (small) amount of time, emotional intensity, in- 
timacy, and reciprocal services that characterise the link 
connecting node i to node j. Among the various measures 
introduced by Burt to detect and quantify the presence 
of structural holes, a key role is played by the effective 
size of a node's local network. 

The effective size of node z's network indicates the ex- 
tent to which each of the first neighbours of i is redund- 
ant with respect to the other neighbours, and can be ex- 
pressed in terms of the two following matrices: the trans- 
ition matrix P and the marginal strength matrix M . The 
entry pn of matrix P measures the proportion of i's net- 
work time and energy invested in the relationship with 





Figure 1. The clustering coefficient of a node, as defined in Ref. [59(], measures the cohesiveness of the node's neighbourhood. 
Despite the different configurations of links, the clustering coefficient of node i, measured as the fraction of links among i's 
neighbours over the total possible number of such links, is equal to d = 1/2 in both graphs. Conversely, the local efficiency of 
node i in graph a) is larger than in graph b). 



node t, and is defined as 
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wu 



Em(' 
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where wu + Wu is the sum of the weights of the two 
links connecting i to t, while ^2 m (wi m +w m i) is the total 
strength of node i. This is the sum of the out-strength, 



i , and the in-strcngth, s- n = 



Of 

i, i.e., the sum of the weights of all the incoming and 
outgoing links incident upon i. Notice that by definition 
< pu < 1 with pa = if there is neither a link 

from i to £, nor a link from £ to i. Also, the transition 
matrix P is stochastic: EiPu = 1- The entry rrijt of 
the second matrix, the marginal strength matrix M , is 
defined as: 



rriji 



i(Wj 



(4) 



Again, < rriji < 1 Wj, I, with = if there is neither 
a link from j to £, nor from £ to j. Notice that the two 
matrices P and M defined above are non-symmetric. 

According to the definition given by Burt, the effective 
size Si of node i's network reads Nlml: 



E 



Pitrriji 



(5) 



where Ni is the set of neighbours of i. Moreover, 1 < 
Si < hi Vi, that is the effective size of node i's network 
ranges from its smallest value equal to 1, when node i 
belongs to a clique, to a maximum value equal to the 
node degree fc, , when there are no links (j, £) connecting 
any two neighbours j and £ within i's network, i.e. when 
i is the centre of a star graph. In general, the more 
redundant the neighbours of i are, the smaller the value 
of Si is, and viccversa. 



The expressions above largely simplify in the case in 
which the graph is undirected and/or unweighted. In 
fact, when the graph is undirected, we have: wu = 
wu yi,£. In this case, the entries of the transition matrix 
P and of the marginal strength matrix M read, respect- 
ively: 



pa 



e out 



and 



rriji 



max™ Wj 



(6) 



(7) 



Furthermore, if the graph is also unweighted, we have: 
Wu = an Vi,£, where an is equal to 1 if there is a link 
between i and £, and to zero otherwise. This implies that 
max m Wj m = max m dj m = 1 for any node j that is not an 
isolated node. Consequently, the entries of the transition 
matrix P and of the marginal strength matrix M reduce 
to: 



Pi 



and 



Em a i 



m,j£ = cijg 



(8) 



(9) 



In this case, P is the transition probability of a random 
walk on the graph [ToT ] , while the marginal strength mat- 
rix M coincides with the adjacency matrix A. 

Let us now consider the following example to illustrate 
the meaning of Eq. ([5]). Figure [2] reports two undirected 
unweighted graphs, G a and G b , having the same num- 
ber of nodes, N = 8, and the same number of links, 
K = 13. In both graphs, node i is coloured in yellow and 
has ki = 5 links to its neighbours, shown as dashed lines. 
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The links connecting the neighbours of i are indicated 
by solid black lines. By visual inspection it can be easily 
noticed that the neighbourhood of i in graph G b contains 
more redundant links than in G a . Indeed, if we evalu- 
ate the effective size of node i in graph G a , we find that 
the contribution of three out of the five neighbours in i's 
network towards the summation in Equation ([5]) is equal 
to 1. This is because each of these three nodes in Af? 
has no links to the other neighbours of i, and therefore is 
non- redundant. The contribution of each of the other two 
remaining neighbours in 'i's network towards the summa- 
tion in Equation ([5]) is equal to 1 — l/k%. We finally have: 
Sf = 1 + 1 + 1 + (1- 1/5) + (1 - 1/5) = 3 + 8/5 = 23/5, 
which is a number larger than 3, but smaller than the 
actual degree ki = 5 of the node. If we calculate the 
effective size of node i in graph G b , we obtain: S b = 
(l-l/5)+(l-2/5)+(l-2/5)+(l-2/5)+(l-l/5) = 17/5. 
Because of the higher redundancy of the nodes in Af b , this 
value is smaller than the effective size of node i's network 
in graph G a . 



of length 3 connecting node i to node j. In particular, 
by setting i = j, the quantity J2i, m a ^ a £™ a ™.i denotes 
the number of closed walks of length 3 from node i to 
itself. This is twice the number of triangles containing 
node i. The generic triangle containing node i and the 
two nodes I and m is made of the two links connected 
to node i, namely and (to, i), and of the link (£,m) 
that belongs to Gi. Since the link (£, to) appears twice, 
namely in the closed walk (i,£,m,i) and in the closed 
walk (i, to, £, i), the number of links A'[G.;] is given by: 



K[Gi] = W 



(10) 



0, m 



Notice that this is the numerator of Equation ([IJ , so that 
we can express the local clustering coefficient of node i 



t °>ij a j£ a £i 
kiiki — 1) 




for k, > 2 
for ki = 0, 1 



(11) 



A SIMPLE RELATION BETWEEN CLUSTERING 
AND EFFECTIVE SIZE 

The two examples outlined in the previous Section 
already point to the existence of a relation between the 
clustering coefficient and the effective size of a node's 
network in an unweighted graph. In fact, the values of 
Cj and Si both depend on the number of triangles con- 
taining node i. The larger the number of triangles, i.e. 
of closed structures involving node i, the larger the clus- 
tering coefficient Ci . Conversely, the smaller the number 
of triangles, the larger the number of open structures 
that can be exploited by i, and thus the larger the ef- 
fective size of node i's network. Therefore, we expect an 
inverse relation between Cj and Sf. the larger the cluster- 
ing coefficient of a node, the smaller the effective size of 
the node's network. In this Section, we show that there 
is indeed an exact and simple relation between the two 
measures. Based on this relation, it can then be argued 
that it is not necessary to operationalise two distinct con- 
cepts (cohesion and structural holes) and use two differ- 
ent measures (clustering and effective size) to investigate 
two distinct sources of social capital, when one source 
can be measured simply in terms of the other. More 
specifically, either measure, together with the degree of 
the node, is sufficient to quantify both the local cohesion 
and the structural holes characterising the node's local 
network. 

We first notice that the definition of node clustering 
coefficient given in Equation (Q]) can be expressed in 
terms of the adjacency matrix of the graph. In fact, 
the number of links K [Gi] in graph Gi can be easily 
calculated from the adjacency matrix by observing that 
(A 3 )ij = J2i m a U a lm a mj is equal to the number of walks 



Let us now consider the effective size of node i's net- 
work. When the graph is undirected and unweighted, 
the entries of the transition matrix P and of the marginal 
strength matrix M read, respectively, as in Equations ([8]) 
and ©. Consequently, the effective size Si of node i's 
network can be written as: 



Si = ^2 a* 



Pumji 
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If we plug the expression for the clustering coefficient of 
node i in Equation (fTT|) into Equation (fT2"j) , we obtain: 



Si 



{h - l)Ci 



(13) 



This is an exact relation that connects three measures 
at the node level: effective size, clustering, and degree. 
For instance, we can use the relation to obtain the ef- 
fective size of a node's network by measuring the clus- 
tering coefficient of the node. Since for the two graphs 
in Figured] we have Cf = 1/10 and C\ = 4/10, by us- 
ing Equation [13] we obtain: 5? = 5 - 4/10 = 23/5 and 
S\ = 5 — 16/10 = 17/5, in perfect agreement with the 
values obtained by using Definition [5] More generally, 
Equation (|13|) provides a formalisation of the fact that 
structural holes and social cohesion are indeed the two 
faces of the same coin. The presence of structural holes, 




Figure 2. The effective size of a node's network, as defined in Ref. [(J, measures the lack of redundancy of the node's contacts. 
The neighbourhood of node i in the first graph is less redundant than the one in the second graph. Consequently, the effective 
size, <Sf = 23/5 = 4.6, of node i in graph G a is larger than the effective size, = 17/5 = 3.4, of node i in graph G b . 



measured by the effective size of a node's network, de- 
pends only on the degree of the node and on the so- 
cial cohesion of the node's local network, as measured by 
the node clustering coefficient. Conversely, the clustering 
coefficient of a node is uniquely determined by the degree 
of the node and by the effective size of its network. 

By definition, the effective size of a node's network 
can take values between zero and the degree of the node, 
while the clustering coefficient varies from zero to one. 
To make the two quantities comparable, we can normal- 
ise the definition of effective size. To this end, we can 
define the normalised effective size of node i's network, 
<Sj', dividing the effective size Si by its maximum possible 
value ki, namely: S^ = Si /ki. When both measures are 
normalised in [0, 1], their relation reads: 



ft,?, 



(14) 



which, for nodes with large degree, is well approximated 
by: 



5i ~ i - a 



(15) 



This equation indicates that the clustering coefficient and 
the normalised effective size are indeed two complement- 
ary measures that can be defined one in terms of the 
other. As a result, this relation cautions against using 
both measures simultaneously for detecting sources of so- 
cial capital. For instance, the inclusion of both clustering 
and effective size as covariates in a multivariate regres- 
sion model would inevitably entail problems of multicol- 
linearity due to the linear relation found between the two 
measures. 

Drawing on this relation, we can express in terms of 
effective size many of the results obtained for the cluster- 
ing coefficient in real networks. In particular, it has been 
found that in many networks the clustering coefficient of 



a node scales with the degree of the node as k~ u , where 
< w < 1 [5(| . This means that high-degree nodes 
tend to have a relatively smaller clustering coefficient 
than low-degree nodes. Consequently, in such networks 
effective size will increase with k, so that higher-degree 
nodes will exhibit a higher effective size than lower-degree 
nodes. The reason for this is precisely the mirror im- 
age of the argument typically proposed to explain the 
inverse relation between clustering and degree: any two 
neighbours of a large-degree node are more likely not 
to be directly connected with each other than any two 
neighbours of a low-degree node. Moreover, it is possible 
to use any model of random networks with a tunable 
degree-dependent clustering coefficient 0, 46, 4^, 57 to 
construct a random network with a fixed distribution of 
effective size. In turn, such random network can be used 
as a null model to assess the statistical significance of the 
measured correlation, at the node level, between effective 
size on the one hand, and other structural properties or 
performance-related outcomes, on the other. 



RECONCILING SOCIAL COHESION AND 
STRUCTURAL HOLES: SIMMELIAN 
BROKERAGE 

As discussed in Section,, local efficiency is a generalisa- 
tion of the node clustering coefficient, in that it measures 
the extent to which the neighbours of node i would reach 
each other if i were removed from the network. Unlike 
the clustering coefficient, local efficiency allows us to dis- 
tinguish between cases where the subgraphs Gi have the 
same number of nodes and the same number of links, but 
are topologically different, as was the case of the two ex- 
amples in Figure [1] In this Section, we show that local 
efficiency can be effectively employed to develop a new 
measure for brokerage that lies at the interface between 
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clustering and effective size. Like clustering and effective 
size, this measure will be sensitive to structural gaps in 
a node's local network. However, unlike clustering and 
effective size, it will also be sensitive to variations in the 
position of links across local networks of the same dens- 
ity. For this reason, the measure is capable of capturing 
brokerage opportunities among otherwise disconnected 
socially cohesive groups of nodes. 

We begin by observing that, if the local efficiency of 
node i is small, then i plays an important role in enabling 
and facilitating reachability among its neighbours. In 
this case, i acts as a broker among its neighbours, since 
the removal of i would disconnect many pairs of nodes 
in i's neighbourhood, or would inevitably deteriorate the 
ability of these nodes to reach each other. Conversely, 
if the local efficiency of i is large, then the nodes in i's 
neighbourhood would still be able to reach each other 
even without the intermediary role of i, and as a res- 
ult they would barely be affected by i's removal. In this 
case, i plays a negligible brokerage role. More gener- 
ally, the higher the local efficiency of a node, the fewer 
the opportunities a node has to act as a broker, and vice 
versa. Based on this observation, on the relation in Equa- 
tion (fl"3|) . and on the fact that the clustering coefficient 
and the normalised effective size range in the same in- 
terval [0, 1], here we introduce the following measure for 
local brokerage Bi of node i: 



Bi = h - (h -l)Ei 



(16) 



In qualitative agreement with Krackhardt's |33j idea of 
Simmelian ties as ties embedded in cliques, we propose 
to call this measure Simmelian brokerage. Our choice is 
motivated by the fact that the measure is indeed sensitive 
to the extent to which a node acts as a broker between 
Simmelian ties or, alternatively, between otherwise dis- 
connected groups of densely connected nodes. This is the 
case of a node that is a member of different cliques, and 
thus acts as the intermediary between two or more dis- 
connected sets of Simmelian ties, rich in third-party rela- 
tionships HI HI ■ The definition of Simmelian brokerage 
is similar to that of effective size, with the only difference 
that the clustering coefficient d of node i is replaced by 
the node local efficiency Ei . According to Equation (fT"6| , 
when the degree of a node is fixed, an increase in the 
value of local efficiency corresponds to a decrease in the 
value of Simmelian brokerage, and vice versa. 

To shed light on the relation between effective size and 
Simmelian brokerage, we now discuss a number of ex- 
amples of brokerage opportunities in unweighted undir- 
ected graphs. Figure [3] shows six graphs, each with N = 9 
nodes, but with a different number and configuration of 
links. A first inspection of the figure makes it immedi- 
ately clear that the central node i, indicated in yellow, 
has a different brokerage role in each of the six graphs. 
Graph a) is a clique, i.e. a complete graph in which each 
node has a link to each of the other nodes. This structure 



is characterised by high redundancy - indeed the max- 
imum possible redundancy among all the graphs with the 
same number of nodes - due to the presence of the max- 
imum possible number of links in the graph. In this case, 
the local efficiency of node i is equal to Ef = 1.0, and 
for Simmelian brokerage we obtain the smallest possible 
value, Bf = 8 — 7 x 1.0 = 1.0. Such a small value is 
consistent with the relatively negligible role that i has in 
facilitating reachability among its neighbours: if i were 
removed from the graph, not only would its neighbour- 
hood remain connected, but each of its contacts would 
still have a direct connection with each other. In the 
overall economy of the network, node i is thus a redund- 
ant contact. 

Graph b) is a wheel graph, where the neighbours of 
node i are arranged in a cycle. As in the case of a clique, 
here the neighbourhood of node i remains connected even 
when i is removed from the graph, so that i can be con- 
sidered somehow redundant. However, while some pairs 
of nodes in the induced graph have distance equal to 1, 
the majority of pairs of nodes are at distance 2, 3 of 4, 
so that the corresponding local efficiency is E\ ~ 0.559 
and node i's Simmelian brokerage is equal to B\ ~ 4.083. 
The relatively higher value of i's Simmelian brokerage in 
graph b) than in graph a) reflects the more central role 
played by i in b) than in a) in facilitating reachability 
among its neighbours. This is also reinforced by the fact 
that, when i is removed from the graph, the average dis- 
tance among its neighbours increases from ~ 1.714 (in 
graph a)) to ~ 2.286 (in graph b)). 

From a visual inspection, we would also expect the 
Simmelian brokerage of node i to increase from graph b) 
to graph c) due to an increase in structural gaps between 
distinct groups of nodes. In graph c), node i does not 
only intermediate between already connected nodes, but 
it brokers between different groups of nodes that would 
otherwise remain disconnected. In this case, the local 
efficiency of i is E\ ~ 0.309 and, as expected, the corres- 
ponding Simmelian brokerage of i is higher than in graph 
b), and is equal to B\ ~ 5.83. 

Graphs d) and e) have the same number of links K = 
4, but with a different configuration. Interestingly, the 
effective size of node i's network in the two graphs is 
the same, (Si = 7.0), since in both graphs the clustering 
coefficient of i is equal to Cj = 1/7. However, a removal 
of i causes a more significant damage in graph c) than in 
graph d), since the intermediary role of i is more crucial 
in e) than in d). Moreover, in graph e) node i is affiliated 
with four distinct cohesive groups of connected contacts, 
whereas in graph d) node i intermediates between three 
isolated contacts and one loosely connected group. In 
this sense, in graph e) node i spans more structural holes 
between Simmelian tics than in graph d) . Our measure of 
Simmelian brokerage does indeed capture this difference. 
In fact, if we removed i from graph d), then 10 out of 28 
pairs among i's neighbours (namely all pairs involving 
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Figure 3. The extent to which a node acts as a Simmelian broker depends on the number and configuration of links in its 
neighbourhood. The six graphs correspond to six values of Bi, in increasing order: a) 1.0, b) 4.08, c) 5.83 d) 6.46, e) 7.0 f) 8.0. 
When the neighbourhood of i is a clique (graph a)), the Simmelian brokerage of i is negligible, and i is practically superfluous. 
As soon as the neighbourhood becomes sparser and more structural holes appear, node i acquires higher values of Simmelian 
brokerage. In graphs d) and e), node i has the same clustering coefficient (and thus the same effective size), but the value of 
Simmelian brokerage of i is higher in e) than in d) because in e) i intermediates among four cohesive groups of nodes, whereas 
in d) i intermediates between one group of four loosely connected nodes and three otherwise isolated nodes. Opportunities of 
brokerage are maximised in the limiting case of a star graph in which node i intermediates among eight otherwise disconnected 
contacts (graph f)). 



the five nodes in the group) would still remain reachable. 
In this case, the value of Simmelian brokerage of i is 
Bf ~ 6.46. Conversely, the removal of i in e) would 
produce a more serious damage to the network, since only 
4 pairs of neighbours of i over 28 would remain reachable. 
In this case, the value of Simmelian brokerage is equal to 
B% = 7.0. 

Finally, graph f) is a star, so that by removing node 
i no pair of its neighbours would remain reachable any 
longer. In this limiting case, the value of local efficiency 
of node i is equal to zero, and the one of Simmelian 
brokerage, as well as of effective size, is equal to the node 
degree, namely B[ = 8.0. In general, if two nodes have 
the same local efficiency, the one having the higher degree 
has a higher Simmelian brokerage. This is due to the fact 
that the removal of a high-degree node could potentially 
leave a higher number of pairs of nodes disconnected, and 
could therefore cause a more substantial damage to the 
network than the removal of a node characterised by a 
relatively low degree. 



CONCLUSIONS 

Graphs are an invaluable mathematical tool for ex- 
amining the topology and evolution of social structures, 
and graph measures have contributed to the operation- 
alisation and formalisation of fundamental sociological 
concepts as well as to the development of social theor- 
ies. Among these measures, clustering and effective size 
have played a pivotal role in the debate that, over the 
last few decades, has been concerned with the types of 
social structures that matter as sources of social capital 
H E 22, 37, [38 1 . In this paper, our contribution to 
this debate began by reviewing the two measures, clus- 
tering and effective size, typically associated with two 
opposing types of social structure, the closed and open 
structure respectively. We then clarified the relationship 
between these two measures, and found that they are in- 
deed connected through a simple mathematical relation. 
While so far the two measures have been related to each 
other primarily at a conceptual level and on intuitive 
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grounds [6j, in this paper we provided a formal frame- 
work in which one measure can be expressed in terms of 
other and vice versa. 

The study of formal relations between different graph 
measures can help unveil the intimate connections 
between already existing, and apparently unrelated, so- 
ciological concepts and, at times, even lead to the de- 
velopment of new concepts and measures. This indeed 
describes the trajectory that brought us from a more 
thorough understanding of the relation between closed 
and open structures to the proposal of a new measure 
that captures a topological configuration at the interface 
between the two types of structure. The idea was to 
identify brokerage positions in which a node can inter- 
mediate between otherwise disconnected cohesive groups 
of contacts. In such cases, the node's local network can 
be seen as both open and closed: open in that it is rich 
in structural holes separating distinct groups of contacts; 
and closed in that it is at the same time rich in third- 
party relationships within each of the groups with which 
the node is affiliated. 

In qualitative agreement with the organisational lit- 
erature on Simmelian ties 0, [33| , we proposed to call 
Simmclian brokerage the new measure for detecting such 
structural positions. Simmelian brokerage helps differen- 
tiate between brokerage positions of nodes with the same 
degree and the same local clustering coefficient, but with 
a different configuration of links in their local neighbour- 
hoods. In those cases, effective size would also remain 
unchanged as, all else being equal, it is not sensitive to 
variations in the positions of links. However, brokerage 
opportunities are likely to differ when, simply by reshuff- 
ling the same number of links across a node's local neigh- 
bourhood, there is a variation in the number of socially 
cohesive groups with which the node is affiliated. Sim- 
melian brokerage, unlike effective size, is sensitive pre- 
cisely to these variations in group affiliation that result 
from a change in the position of links. 

Our findings can nourish the theoretical debate about 
the relative salience of closed and open structures for so- 
cial capital, and will inform further research on the gen- 
erative mechanisms of social capital. On the one hand, 
empirical tests of the relative benefits of closed and open 
structures will now find in our proposed relation between 
clustering and effective size a sound argument safeguard- 
ing against problems of multicollinearity, typically arising 
as a result of the simultaneous inclusion of both meas- 
ures as explanatory variables in multivariate regression 
models. On the other, future research on the relative 
benefits of cohesive and brokered networks will benefit 
from the application of Simmelian brokerage to a num- 
ber of empirical domains. In this sense, our study will 
help reconcile the apparently opposing results that vari- 
ous strands of literature have uncovered on the structural 
foundations of social capital [l , 1{J 

mm, Hunzt 

Simmelian brokerage, as a new topological measure of 



network structure, can also spur a wealth of research 
broadly concerned with the topology and dynamics of 
complex networks. For the sake of simplicity, in this pa- 
per we have restricted our focus only to the case of un- 
weighted networks. However, the relation between clus- 
tering coefficient and effective size can be easily general- 
ised to the case of weighted graphs, which will in turn en- 
able Simmclian brokerage to be also extended to weighted 
graphs. 

More generally, the main implication of our study for 
research on complex networked systems lies in the change 
of perspective entailed by our emphasis on structural 
cleavages, as opposed to ties, that we borrowed from the 
burgeoning network literature in the social sciences. In 
this sense, our study may suggest a number of possible 
and previously neglected ways in which, simply by de- 
flecting attention from the presence to the absence of a 
tie, new insights can be gained on the organisation, func- 
tioning and dynamics of a variety of systems. 
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